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Abstract. A Borel system consists of a measurable automorphism of a stan- 
dard Borel space. We consider Borel embeddings and isomorphisms between 
such systems modulo null sets, i.e. sets which have measure zero for every 
invariant probability measure. For every t > we show that in this category 
there exists a unique free Borel system (Y, S) which is strictly t-universal in 
the sense that all invariant measures on Y have entropy < t, and if (X, T) is 
another free system obeying the same entropy condition then X embeds into 
Y off a null set. One gets a strictly t-universal system from mixing shifts of 
finite type of entropy > t by removing the periodic points and "restricting" to 
the part of the system of entropy < t. 

As a consequence, after removing their periodic points the systems in the 
following classes arc completely classified by entropy up to Borel isomorphism 
off null sets: mixing shifts of finite type, mixing positive-recurrent countable 
state Markov chains, mixing sofic shifts, beta shifts, synchronized subshifts, 
and axiom-A diffcomorphisms. In particular any two equal-entropy systems 
from these classes are entropy conjugate in the sense of Buzzi, answering a 
question of Boyle, Buzzi and Gomez. 



1. Introduction 

1.1. Background. Motivated by the conjugacy problem for symbolic covers of 
partially hyperbolic systems, in the late 1990's Buzzi [6] introduced the notion 
of an entropy conjugacy between topological dynamical systems: X, Y are entropy 
conjugate if there is a measurable, shift-commuting bijection X \ Xq — > Y \ Yq, 
where Xq C Y, Yq Q Y are invariant Borel sets which have measure zero for every 
invariant probability measure of sufficiently large entropyQ Entropy conjugacy for 
mixing shifts of finite type (SFTs) of the same entropy follows from classical work of 
Adler and Marcus on almost topological conjugacy [1! , and the so-called subshifts 
of quasi-finite-type shifts (QFTs) were classified up to entropy conjugacy by Buzzi 
[7]; this class is of interest because it includes symbolic covers of many examples, 
e.g piecewise entropy-expanding maps and multidimensional beta-shifts [7]. Finally, 
Boyle, Buzzi and Gomez classified the strong positive recurrent (SPR) countable 
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"'A notion with the same name and similar (but not identical) definition was introduced earlier 
by R. Bowen [3] 
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state Markov shifts up to entropy conjugacy, showing that entropy and period are 
complete invariants. Their proof shows that the entropy conjugacy can be realized 
as an almost isomorphism [3]. 

The purpose of this note is to show that on rather general grounds, a stronger 
equivalence than entropy conjugacy holds for quite a wide class of systems. 

1.2. Borel systems and full sets. Our setting is that of Borel dynamics, whose 
basic definitions we summarize in the next few paragraph; for further detail see 
O . Recall that a standard Borel space (X, J 7 ) is a set X and cr-algebra T 
which arises as the a-algebra of Borel sets for some complete, separable metric on 
X. Up to isomorphism (i.e. measurable bijections) there are only three standard 
Borel spaces: those which are finite, those which are countable, and those which 
are isomorphic to [0, 1] with the Borel cr-algebra. We shall always assume the last 
of these. We suppress T in our notation and assume implicitly that all sets are 
measurable unless otherwise stated. Similarly, all our measures are defined on the 
given cr-algebra. 

A Borel system (X, T) consists of an uncountable standard Borel space X and 
a bijection T : X — > X such that T, T -1 are measurable. T is free if it contains 
no periodic points, i.e. T n x ^ x for all x G X and all n ^ 0. By [15], the set 
of periodic points is a Borel set, and if the complement of the periodic points is 
uncountable then restricting T to it gives rise to a Borel system which we call the 
free part of (X,T). 

Two Borel systems {X, T), (Y, S) are isomorphic if there is a Borel isomorphism 
p : X — >• Y such that tpT = Sp. If instead tp is only a Borel injection and p>T = Sp 
we call it a Borel embedding, and say that (X, T) embeds into ( Y, S) . We note that 
the image of a Borel injection pi : X — > Y is a Borel subset of Y, and the partially 
defined inverse <p \tp<x) is measurable. 

For a Borel system (X, T), let 

£(X,T) — {ergodic T-invariant probability measures on X}. 

We say that a Borel set X C X is universally null if n(X ) = for all fi G £{X, T). 
The universally null sets form a c-ideal, although if £(X,T) = all sets will be 
null. A set is full if its complement is universally null. We say that Borel systems 
(X,T), (Y,S) are Borel isomorphic on full sets if there are invariant, full Borel 
subsets of each on which the restricted maps are isomorphic. We say that (X, T) 
embeds into (Y, S) on a full set if there is a full, invariant set in X which embeds, 
with the restricted actions, into Y. 

The Gurevich entropy [TT] of a Borel system is defined by 

h(X,T) — sup h(T,ii) 

fi.e£(X,T) 
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or — oo if £(X,T) = 0. When the transformation is clear from the context, we 
abbreviate the notation to £ (X), h(X),h(/i) etc. 

1.3. Embedding and universality. There are two obvious obstructions to em- 
bedding on full sets. One is the entropy of invariant measures: clearly one cannot 
hope to embed (X,T) into (Y, S) if h(X) > h(Y). The other obstruction comes 
as usual from periodic points. Every full set includes all periodic points, since a 
periodic point has positive mass with respect to the unique invariant probability 
measure on its orbit. Hence an embedding of X into Y requires that for each k the 
number of periodic points of period k in Y be no less than the corresponding count 
in X. 

Since the periodic points in a Borel system form an invariant Borel set [15] one 
may deal with this set separately; indeed, this part of a system is classified up to 
Borel isomorphism by the cardinality of the set of points of period each k. For this 
reason we shall mostly be interested in free actions, or the free parts of non-free 
actions. 

Definition 1.1. A Borel system (X, T) is t- universal if any free Borel system (Y, S) 
of entropy < t embeds into X on a full set. 

According to this definition (X, T) may have measures of entropy > t and peri- 
odic points. A stronger property is given in the following definition. 

Definition 1.2. A t- universal system which is free and has no ergodic measures of 
entropy > t is strictly t-universal. 

It is convenient to introduce one more definition: 



Definition 1.3. A t-slice of a Borel system A is a Borel subset X t Q X such that 
for (j,e£(X,T), 



A free t-slice is a t-slice of the the free part of X. 

Note that, up to universally null sets, a Borel system has a unique t-slice, and 
we therefore refer to it as the t-slice. Similarly the free t-slice is unique up to null 
sets in the free part of X. We show that i-slices exist in section |4~T1 

Elementary considerations and Cantor-Bernstein type arguments now establish 
the following basic properties of universal and strictly universal systems. 

Proposition 1.4. Let t > 0. 

(1) Any two strictly t-universal systems are isomorphic on full sets. 

(2) The free t-slice of a t-universal system is strictly t-universal. 
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(3) If (X, T) is free, supports no ergodic measures of entropy > t, and contains 
an s-universal subsystem for every s < t, then (X,T) is strictly t-universal. 

1.4. Main results. We next turn to our main results. The next theorem shows 
that t-universal (and hence strictly t-universal) systems exist. See section [2] for 
the definition of mixing shifts of finite type, and section [3] for discussion of generic 
points. 

Theorem 1.5. A mixing shift of finite type (X,T) is h(X) -universal, and for every 
t < h(X), the free t-slice of X is strictly t-universal. 

Note that the Krieger generator theorem states that for any non-atomic invariant 
probability measure /i on a Borel system Y, if X is a mixing SFT and h(fi) < h(X) 
then there is a set of full measure for /x which embeds into X. In fact this is a Borel 
theorem in the sense that, given a free Borel system (Y,S) with h(Y) < h(X), 
the proof of the generator theorem produces a Borel map from a full set in Y 
into X, which, for every pi G £(Y,S), is an injection on a set of /x-measure one 
(a similar observation is made in concerning the Jewett-Krieger theorem). The 
main innovation in the proof of Theorem 11.51 is a coding argument which ensures 
that for distinct ergodic measures /i, v on Y the images of /x, v under the embedding 
are disjoint. 

The systems listed in our next theorem all have the property that they contain 
embedded mixing SFTs of arbitrarily large entropy. The theorem then follows from 
the previous one and Proposition II .41 (3): 

Theorem 1.6. For X in any of the following classes and to < h(X,T) then the 
to- slice of X is to-universal: 

• Mixing sofic shifts, 

• Mixing countable state Markov shifts, 

• Mixing axiom- A diffeomorphisms, 

• Mixing synchronized subshifts, 

• The natural extensions of (3-shifts, 

• Intrinsically ergodic mixing shifts of quasi-finite type. 

If a free t-universal system has a unique measure of maximal entropy then the 
space can be partitioned into a set supporting only this measure and no other 
invariant probability measures, and the remainder, which is strictly t-universal. 
Thus in this case isomorphism of the original system on a full set is determined by 
the isomorphism type (in the sense of ergodic theory) of the measure of maximal 
entropy. In particular, when two such systems have measures of maximal entropy 
which are Bernoulli they are isomorphic on full sets. Using the fact that systems 
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in the classes above, with one exception, support a unique measure of maximal 
entropy which is Bernoulli, we have: 

Theorem 1.7. Let X,Y be any two systems drawn from the classes listed men- 
tioned in Theorem \1.6\ excluding Markov shifts which are not positive recurrent. 
If h{X) = h(Y) then the free parts of X, Y are isomorphic on full sets, and in 
particular X, Y are entropy conjugate. 

Similarly, positive recurrent Markov shifts are classified up to isomorphism on 
full sets by entropy and period; and similarly the class of recurrent non-positive 
recurrent Markov shifts. 

This answers Question 6.5 in [4], showing that the SPR property of Markov shifts 
is not an invariant of entropy-conjugacy, and whether positive recurrent Markov 
shifts of equal entropy and period be entropy-conjugate. 

Quasi- finite- type (QFTs) systems were introduced in [7], where they were shown 
to be Borel isomorphic to the union of finitely many strong positive recurrent 
Markov shifts together with a Borel system of lower entropy [7] . Let us say that a 
dynamical system is entropy-mixing if it has a mixing measure of maximal entropy. 

Theorem 1.8. The free part of entropy-mixing guasi- finite-type subshifts are clas- 
sified up to isomorphism on full subsets by the number and periods of their measures 
of maximal entropy. 

The hypothesis of entropy-mixing is necessary: Consider a positive entropy SFT 
X of period 2, and let X 1 be the union of X with a mixing SFT of lower entropy. 
Then X, X' are QFTs whose (unique) maximal measures are isomorphic, but X 1 
supports a mixing measure while X does not. Therefore they cannot be isomorphic 
on full sets. 

1.5. Open problems. We end this introduction with several questions which arise 
from this work. A natural question is whether the isomorphisms we construct can 
be given any amount of continuity when the original systems carry a topology. Let 
us suggest the following concrete problem, which we have been unable to resolve: 

Problem 1.9. Let X, Y be mixing SFTs on finite alphabets, and h(X) = h(Y). 
Let X', Y' denote the sets obtained by removing all periodic points from X, Y. Is 
there a topological conjugacy between the (non-compact) systems X' and Y'l 

We do not know the answer even if we require only that X', Y' be full sets. Note 
that by the Adler-Marcus theorem on almost topological conjugacy [I], one can find 
a topological isomorphism between subsets of X", Y" which are dense G$'s, contain 
all points with dense orbits, and have measure one for every invariant measure of 
sufficiently large entropy. Also, any closed subshift of X without periodic points 
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can be topologically embedded in Y, and visa versa. However the known methods 
do not seem to extend all the way "down" to the very low complexity part of the 
systems. 

In another direction, one may ask for a purely Borel formulation. While our 
methods produce Borel maps, the sets on which they are defined are "large" only 
from an ergodic-theory point of view. A more natural notion in the Borel category 
is the following one, introduced by Shelah and Weiss [13] , Let (X,T) be a Borel 
system. A Borel set W C A is wandering if its iterates T n W are all disjoint. Let 
W denote the set of countable unions of wandering sets, and all Borel subsets of 
such unions. If X £ W we say that X is purely wandering. If this is not the case 
then W is a cr-ideal, and it is natural to say that a set is Borel-null if it belongs 
to W, and Borel-full if its complement is in W. Thus we arrive at the following 
question: 

Problem 1.10. Let (X,T) be a Borel system, Y a mixing SFT, and suppose that 
h(T,iS) < h(Y) for all ju £ £(X,T). Is there an invariant Borel-full subset I'CI 
such that (X' , T) can be embedded by a Borel map into (Y, S)? 

Like the universally null sets that we have been using, Borel-null sets have a 
characterization in terms of measures: A £ VV is and only if fi(A) = for every 
conservative invariant cr-finite measure fi (this is proved in |14j with "non-singular " 
in place of "invariant" , but from the methods in |14j the version given here follows 
easily). It is also well known that if [i is a cr-finite infinite invariant measure on 
(A, T) and Y is a mixing SFT then there is a set of /i-measure 1 which embeds in 
a Borel way into Y. Thus one might hope that Problem 11.101 can be answered by 
treating each measure individually. However, unlike invariant probability measures, 
for infinite measures it is not possible to partition A into disjoint sets each of which 
supports a single cr-finite measure. Thus it appears that new ideas will be needed 
to answer Problem II. 101 

This problem is essentially equivalent to one posed by Benjamin Weiss in [15] , 
asking whether a Borel action without invariant probability measures always has a 
2-set generator modulo wandering sets. Indeed, given A, Y as in the problem, we 
may embed a full set in A using Theorem 11.5) the complement of this full set has 
no invariant probability measure, and an answer to Weiss' question would probably 
allow one to embed it in Y. 

A positive answer to Problem 1 1 . 101 would lead to a Borel isomorphism, modulo 
wandering sets, between mixing SFTs (and, more generally, SPR Markov shift) of 
the same entropy. Following this work, Boyle, Buzzi and Gomez have shown that 
this conclusion at least is true [5]. 



Isomorphism and embedding of Markov shifts 



7 



In the next two sections we present some background and define generic points 
in our context. In Section [4] we establish some properties of t- universal systems and 
show how to deduce our results from Theorem II .51 In Section [S] we prove Theorem 
Ol 

Acknowledgement. My thanks to Mike Boyle for some very interesting discussions 
and many useful suggestions on the presentation of this paper. This work was 
supported by NSF grant 0901534. 

2. Shifts of finite type 

We briefly recall the definitions of SFTs needed for Theorem ll.61 Let G — (V, E) 
be a directed graph on finite vertex set V. Endow E E with the product topology, 
which is compact and metrizable. The shift of finite type (SFT) defined by G is 
the subshift Xq C E 1 " consisting of bi-infinite paths through the graph, that is 

Xq = {x E E 1 : initial vertex of Xi + \ = terminal vertex of Xi\ 

The shift transformation S : Xq — » Xq is defined by 

(Sx)i = x l+ i 

If the graph G is strongly irreducible, i.e. there is a directed path between every 
pair of vertices, then Xq is said to be irreducible; this is equivalent to the existence 
of a dense forward orbit under the shift. If G is also aperiodic, i.e. there is some 
N so that for every v G V and n > N there is a path from v to itself of length n, 
then Xq is aperiodic in the sense that there is no factor map from Xq to a periodic 
orbit of length > 1. An SFT is topologically mixing if and only if it is irreducible 
and aperiodic. 

Shifts of finite type have a well developed theory. We mention the following: 

Theorem. Let X be a mixing SFT X . Then 

• X has a unique invariant probability measure of maximal entropy which is 
isomorphic (in the ergodic category) to a Bernoulli process. 

• For any ergodic system (Y, T, v) of entropy less than h(X) there is a shift- 
invariant measure on X isomorphic to (Y,T,o~). 

The only fact we shall use about the systems in Theorem 11.61 is that they have 
SFT subsystems of entropy arbitrarily close to the entropy of the system, and in 
most cases (except non positive recurrent countable state Markov shifts) a unique 
measure of maximal entropy which is ergodic-theoretically Bernoulli. For further 
information see [12] (countable state Markov shifts); [13] (sofic shifts and synchro- 
nized systems); [3] (axiom- A diffcomorphisms); [2] (beta shifts); [7] (subshifts of 
quasi finite type). 
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3. Symbolic representation and generic points 

As is often the case in Borel dynamics, it is convenient to introduce a topology 
to our systems. Most convenient for us is the possibility of modeling our systems 
as shift- invariant subsets of shift spaces over countable alphabets. Endow N z with 
the product topology and let T denote the shift map (Tx)i — Xi+i. 

Theorem (Weiss |15j). Any free Borel system can be embedded on a full set into 
(N Z ,T), where T is the shift. 

Next, it will be useful to partition our systems into sets which support a unique 
invariant probability measure. More precisely, we shall describe the set G of generic 
points of N z and the corresponding partition of G according to ergodic probability 
measures. These definitions are standard in topological dynamics but require a 
little care since the space N z in our setting is not compact. 

Let C be the algebra of sets in N z generated by the cylinder sets. We say that 
x G N z has limiting frequencies if every C G C, the limit 

(1) lim ¥ £ l c (S n x) 

n=-N 

exists. The set of points with limiting frequencies is Borel. 

If /j. G £(N Z ) then for /i-a.c. x the limit in ((T|) exists and equals /«(C). We denote 
by G^ the set of points which satisfy this condition for every set C G C. Thus 
n{G^) — 1 and the sets G^. /i G £(N Z ) are pairwise disjoint. 

We claim that G = U Aie £( N z ) G m is Borel. To this end, let Z N = {1, . . . , N + 1} Z 
and let ir^ ■ N z — > Zn denote the factor map defined componentwise by mapping 
the symbol i G N to min{i,iV +1}. Clearly if x has limiting frequencies then 
for every N, the point ttn(x) G Zn satisfies the same condition for every closed 
and open set C C Z^. Then, since Zn is compact, there is a unique invariant 
probability measure fiN on Zjq such that limit (JT]) is equal to /zat(C). 

We say that x G N z is a generic point if it has limiting frequencies, each //jy is 
ergodic, and 

lim ^([JV + 1]) =0 

N— s-oo 

Here [N + 1] is the cylinder set {z G Zn : zq = N + 1}. The set of generic points is 
seen to be Borel. We conclude by showing that the set of generic points is precisely 
G. Indeed, clearly (J G M consists only of generic points. Conversely, if x is generic 
then x G G^ for the measure [i which is the inverse limit of the measures /in. 

4. Embedding and universality 

4.1. Generalities. This section fills in some details which were omitted from Sec- 
tion [OS We first show below that t-slices exist: 



Isomorphism and embedding of Markov shifts 



9 



Lemma 4.1. Let (X,T) be a Borel system. Then (X,T) has a t-slice. 

Proof. We may assume that X C N z and T is the shift. Let G be the Borel set 
of generic points, so that G (~l X is a full set in X. For xeGfll one obtains a 
measure /i^ in a Borel way so that x € G M ^. Since the entropy h(T, pb x ) is a Borel 
function of /U x , we may form the Borel set X t — {t e G : h(S,(i x ) < t}. Clearly 
this is a t-slice. 

We shall need the following to prove Proposition II .41 □ 

Lemma 4.2. Let (X,T) and (Y, S) be Borel systems. Suppose that for each s <t 
there is an embedding (p s : X s >• Y of the s -slice X s C X into Y . Then there is an 
embedding (p : X t Y . 

Proof. Let s n / h(X,T), s n = 1,2,3.... Set = X Sl and jW = X Sn \ X (n \ 
so are disjoint. Clearly X t = \J X^ and, defining tp\xw — <p Sn \xto 7 we have 
ip : Xt Y as desired. □ 

An immediate consequence is: 

Corollary 4.3. If(X,T) is t-universal and {Y,S) is a free system whose invariant 
measures have entropy < t, then Y embeds into X on a full set. 

We can now prove Proposition II .41 

Proof, (of Proposition H~4|) (1) Let (X,T) and (Y, S) be strictly t-universal. Hence 
by the previous lemma there are full invariant sets Xq C X and Yq C Y and Borel 
embeddings <p : Xq Y and ib : Yq X. We note that (p preserves universally 
null sets between Xq and <p(Xq) and similarly for and henceforth wc shall work 
modulo universally null sets. 

We proceed precisely as in the Cantor-Bernstein theorem in the category of sets. 
Form the map r = (ftp and let Y\ = Yq \ <p(Xq) (if this is a universally null set 
we are done, since if is the desired isomorphism). Consider the sets Y n = T n Y\. 
One easily verifies that they are pairwise disjoint, as are X n — ipY n . Let = 
Y o \ U^Li Yn and x oo = X \ U^Li x n- Then ip : X^ -> Y^ is bijective and 
«/> _1 : U^=i-^n — ^ U^Li ^ i s bijective. These sets are complementary up to a 
universally null set, and one verifies that all sets involved are invariant; hence this 
is the desired isomorphism. 

(2) is trivial. 

(3) follows immediately from Lemma T4. 2 1 □ 

4.2. Some proofs. In the next section we prove Theorem II. 51 showing that a 
mixing SFT Y is /i(Y)-universal. Assuming this we have: 
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Proof, (of Theorem ll.6[) . Suppose that (A, T) is a Borel system and that for t < to 
there is a mixing SFT of entropy at least t embedded in A. Since mixing SFTs 
are universal at their entropy, it follows that X is ^-universal for every t < to- By 
Lemma 14. 2\ this implies that X is to-universal , and hence the tg-slice of the free 



Proof, (of Theorem ll.7p . Let (X,T), (Y, S) be two systems of entropy h from 
the list in Theorem II. 7\ excluding the countable Markov chains, and assume we 
have already removed the periodic points from them. Let /i, v denote, respectively, 
the unique measures of maximal entropy on X, Y and let X' = G M C X and 
Y' = G v C Y be the points generic for fi, v. Clearly X' and the /i-slice Xh Q X are 
disjoint (up to null sets), and similarly Y' and Yh- Also /Lt, v are the only invariant 
probability measures on X' , Y' , they have the same entropy as X, Y and therefore 
as each other, and in the classes in question [i, v are Bernoulli. Thus from the 
Ornstein theory there is an isomorphism X' — > Y' defined on a full set (i.e. from 
a sets of //-measure one to a set of ^-measure one). Also, by Theorem 11.61 Xh, Yh 
are Borel isomorphic on full sets. Now the required isomorphism is defined by 
combining the two given isomorphism Xh — > Yh and X' — > Y' , and noting that 
X h U X' and Y h U Y' are full. 

For the class of positive recurrent Markov shifts we proceed as follows. For 
mixing positive-recurrent chains the proof is exactly as above, since there is a unique 
measure of maximal entropy and it is Bernoulli. In the case that (A, T), (Y, S) are 
PR and both have period p > 0, one considers the p-th power of the shift map 
S. Then A, Y split, respectively, into p disjoint T p - and ^-invariant aperiodic PR 
Markov shifts of the same entropy which are permuted cyclically by T, S. Applying 
the above to one pair Xq,Yo we obtain an isomorphism / : (X ,T P ) — > (Yq,S p ) 
defined on full sets X C A and Y ' C Y . Extend / to T l X' Q by f(T*x) = S l f(x). 
This is an isomorphism between the full sets A' = U P =Q T l X and Y' = Uf =0 S' i r o '. 

The case where A, Y are recurrent but not positive recurrent is simpler, since, 
in the mixing case, absent a measure of maximal entropy an /i-slice is already a full 
set, and so the result follows directly from Theorem 1 1.51 and proposition [L4] When 
there is periodicity we pass to an appropriate power of the shift and continue as 
above. □ 

Proof, (of Theorem ll.8p . This follows from Corollary 1], which implies that any 
entropy-mixing QFT shift of entropy h is /i-universal and hence the free /i-slices of 
any two such systems are isomorphic on full sets. As above, the isomorphism can be 
extended to a full set by defining it separately on sets which support the measures 
of maximal entropy, which are products of a Bernoulli and a periodic measure by 



part of A is to-universal. 



□ 



Theorem 1(1)]. 



□ 
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5. Universality of mixing SFTs 

5.1. Assumptions. If (X, T) is a Borel system then by a theorem of Weiss |15) . 
(X,T) may be embedded on a full set into N z as a shift-invariant subset. Thus to 
prove Theorem 1 1.51 it suffices, for t > 0, to prove it for the free part of the f-slice of 
N z . We may focus our efforts on this case. 

For the remainder of this paper we fix the following notation. (Y, S) is a mixing 
SFT on a finite alphabet A and t < h(Y). Let and X = X t be the free part of 
the i-slice of N z . We wish to show that we can embed a full set of X into Y . 
Let Gt — G C\ X t , where G is the set of generic points in N z (see Section [3]) . For 
simplicity of notation we suppress the index t and write G = Gt- No ambiguity 
should arise since the set of all generic points will not be used again. 

5.2. The Krieger generator theorem. The Krieger generator theorem (see e.g. 
[8j) asserts that for every /i £ £{X) one can find a set of ^-measure 1 and an 
embedding of this set into Y (recall that according to our assumptions, h(fi) < 
t < h(Y) and there are no periodic points in X). There are two facts about this 
theorem which we shall need. One is that it is a Borel theorem, in the sense 
that the embedding obtained is a Borel function of the data; see e.g. [9]. More 
importantly, for each /i £ £(X) the inverse map of the embedding is finitary (see 
below) . This feature of the theorem follows easily from most of the standard proofs 
of the generator theorem; we provide a precise statement below. 

Recall that, given a shift-invariant probability measure v on N z , a finitary factor 
map 7r into Y is a factor map defined on a set of ^-measure 1 with the property 
that for v-&.e. x, the symbol tt(x)o is determined by a finite block a;|[_ n „], where 
n = n(x) is a measurable function of x. Such a map can be represented as a 
countable sequence of pairs (a, 6), with a g {j n A^~ n ' n ^ and b G A, consisting of 
all pairs such that 7r(a;)o = b if ir|r_ njn i = a. The space of such sequences of pairs 
(whether they give a well-defined factor map of v or not) is a Borel space with the 
obvious structure. We can further represent such a sequence of pairs as a sequence 
of 0's and l's in an appropriate coding. 

We are now ready to state the version of the Krieger generator theorem which 
we shall need. 

Theorem 5.1. Let t, X t C N z and Y be as above. There exist Borel maps (p : 
X t — > Y and xj} : Xt —> {0, 1} N such that, for every \i £ £(X t ). the map <p is an 
injective factor map X t n G M — > Y, and its inverse is (x-a.s. given by the finitary 
map encoded by ijj(x). 

This folklore version apparently has not been stated in the literature but it 
follows from the standard proofs (e.g. [3 Theorem 28.1]), except that one needs to 
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construct Rohlin towers simultaneously for all measures in a measurable way. This 
can be achieved using Proposition 7.9 of [10] . 

It follows that the map x i— > (ip(x), ip( x )) 1S injective on a full set in X — X t . Our 
task, therefore, will be to construct an embedding ip' : X — y Y which encodes both 
tp(x) and ij)(x) into ip'{x), that is, if' is shift commuting, defined on a full set, and 
from tp'(x) we can recover tp(x) and ip( x )- We accomplish this by partitioning the 
output sequence y — <p'(x) into two subsequences, one a low density subsequence 
which encodes ip{x), and the complementary subsequence which encodes (p(x). This 
is where we use the fact that h(T,/j,) < t < h(Y,S) for all fi 6 £(X): this means 
that encoding x requires less than t bits per symbol of input, while in Y we have 
more than t bits available per symbol. Thus we can aside a small positive-density 
subsequence and still be able to encode <p(x) in the complement. 

5.3. Gaps and markers. Let A be the alphabet on which the subshift Y is defined. 
For a block w E [j A", let Y w denote the set obtained from Y by removing all points 
containing w, which is again an SFT. 

A marker in Y is a block w which is admissible for Y (that is, appears as a 
sub-block in some y € Y) and such that no two occurrences of w overlap. As is 
well known, for every e > one can find arbitrarily long markers in Y with the 
property that h(Y w ) > h(Y) — s, and such that Y w is a mixing SFT. 

Relying on this, we proceed as follows: 

• Choose a marker w in Y such that the SFT 

Y' — Y 

is mixing, and 

ti = h{Y w ) > t 

Also, we assume that w is long enough that every symbol in A appears in 
Y'; this is guaranteed if w is chosen so that h(Y w ) > h(Y a ) for all a £ A. 

Write 

t' = \{h' + t) 

Since Y' is a mixing SFT there exists a transition length M > 1 such that for 
all admissible blocks a, b in Y' and all m > M there is a block v 6 A™ 1 such that 
avb is admissible. Since h(Y') > 0, clearly Y 1 consists of more than one point, so 
we may 

• Choose two distinct blocks u, v of length M in Y' . 

5.4. The domain of the map x i-> y. In what follows we will be given a point 
x G G, and we shall define a point y = f'{x). Our choices will depend on a; in a 
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Borcl way. For C E C we denote by /tx(C) the value of the limit 

4 EM*"*) 



N 

lim 

AT-S-OO N 

n=l 



and by definition of G, the function /x is (the restriction to C of) an ergodic invariant 
measure on X. We denote this measure also by \i. 

Let * be a symbol which does not belong to A; to construct y, we start out with 
y = ...*****..., and replace the *'s in several steps until eventually all are 
removed. 

5.5. Marker structure. Wc first insert a sequence of io's in y whose purpose is 
to partition y into identifiable blocks consisting of the gaps between the w's. 

• Choose a block a e N* such that //([a]) > and such that the minimal 
distance between appearances of a in a; is at least N, where 

N > j^— t ■ (e(w) + AM + 1) 

Such a block a exists because x is not a periodic point. To make the choice 
Borel and dependent only on /z, select a to be the first such block with 
respect to the lexicographical ordering on N*. 

• Let 

I\ = {i € Z : a appears in x at index i] 

• Fori eh, set y { . . . y i+ e( w )-i = w. 

Since N > £(w), the blocks from i to i + i{w) — 1 do not overlap, so the last step 
produces a well defined sequence of w's in y. 

Note that the block a was chosen in a manner which depends only on the set 
{C E C : n{C) > 0}. Therefore if x' is any other point giving rise to the same 
function /x(-), then the same block a will be chosen, so the choice is consistent on 

5.6. Coding x. Recall that, given a set A C N z , for z 6 A the first return time of 
z to A is 

r A (z) = min{n > : S n z E A} 
with ta(z) = oo if S n z £ A for all n > 0. The induced map Sa '■ A — > A is defined 

by 

S A z = S rA(z) z 

whenever Ta{z) < oo. If fi,(A) > then by the Poincare recurrence theorem, all 
powers (positive and negative) of Sa are defined at ix-a.e. point in A, and Sa 
preserves the measure 

M0 = ^M-nA) 
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The entropy of (A, /j,a, Sa) is given by Abramov's formula, 

h(S A ,H) = —}-rrh{S,n) 

Let a be the block chosen above, let 

A = N Z \ (J S- n [a] 

n=-M 

and let Sa denote the corresponding induced map. 

Let P denote the partition on N z according to the symbols at coordinates 
0, . . . , £(w) + AM symbols of a sequence, and order the elements of P in some 
fixed way as P = {P\, P2, ■ ■ • }• It is easy to see that P is a generating partition for 
Sa, i-e. for x G A the sequence n(i), defined by S % A x G P n m, determines x. This 
sequence (rij) = (n,-(a;)) is called the itinerary of x. 

Let x' G N z denote the itinerary of x under Sa with respect to the partition P. 
Let v denote the measure on N z obtained from ha by pushing it forward through 
x M> x', so that (N z , S, v) = (N z , ^|^, Sa), and note that the map x M> .t' is jU^-a.e. 
an injection (and by definition measure-preserving w.r.t. v). 

The block a was chosen in such a way that the frequency of occurrences of a (i.e. 
the density of I\) does not exceed 1/N and therefore n([a]) < l/N. Therefore, 

so by Abramov's formula, 

M5 ' V) ~ ^4) ^ ^ 1-W«,)+4M)/JV' < 
Let </3, ^ be the maps in Theorem 15.11 with respect to h' and the SFT Y', that is, a 
factor map of the free part of the /i'-slice of N z into Y'. 

• Let y' = ip(x'). 

• Let 

I 2 = {ieZ : S l x e A} 

and let n(i) £ Z be the unique integer such that Sa % — S l x (in other 
words, n(i) is the cardinality of the set I2 H (0, i) or 7 2 D [«, 0) (depending 
on the sign of i). 

• For i G / 2 let y 4 = (y') n (i)- 

It is not hard to verify that, since y> commutes with S, this definition of y commutes 
with S. Also, note that A was defined so that if i £ Ii and j G I2 then either 
j<i — M or j>i + £(w) + 3M. Therefore the definition above does not conflict 
with the previous one. 
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5.7. Coding tp(x'). Note that we have determined y except for a block of M 
symbols preceding each w, and a block of 3M symbols following each w. We next 
wish to encode "4>(x') into the remaining space, specifically into the block of length 
M starting M symbols to the right of each w. Recall that tp(x') is an infinite 
sequence (ovO^Li, cr„ G {0, 1}. Note that ip is a shift-invariant function. 

We encode (<r n ) in y by controlling the frequency of a certain block (this is one of 
many possible methods). We rely on the following elementary fact: If \r n — 2~™ 2 1 < 
2 - " 2 and / = Y^=i a n r n is given, then we can recover a n from / (instead of 
2~" 2 we could choose any other rapidly decreasing sequence). Recall that u, v were 
chosen above to be blocks of length M in Y' . 

• For n = 1, 2, 3 . . . choose pairwise disjoint U n G C with U n C [a] and such 



Such a family exists as long as (j, arises from a non-atomic and regular 
measure. To make the choice Borel, for each n choose U n to be the first 
element of C such that U n C [a] \ Ufc<„£4 and which satisfies the inequality 
above. 

• For i e h, 

— If T % x G U^Li an d a n = 1 then set the block in y starting at 
i + £(w) + 2M to u, 

— Otherwise, set this block to v. 

We have now determined y except at blocks of * of length M which occur immedi- 
ately before and after a w, and another such block beginning at offset 2M to the 
right of each w. From the definition of M, we can now eliminate the *'s: 

• Fill in each block of *'s with an admissible block from Y' chosen in a manner 
which depends only on the symbols adjacent to the block. 

Notice that, since w does not appear in Y' and u, v are admissible for Y', the blocks 
in y which make up the complement of the original w-blocks do not contain any 
to's. Since w is a marker, it follows that the only occurrences of w in y are those 
which we created intentionally, i.e. those starting at indices i G Ii 

5.8. Decoding. The procedure which produced y is Borel, shift-invariant and de- 
fined /z-a.e. for every /j, G £(X,T). It remains to explain why it is invertible and 
why the images of different ergodic measures do not intersect. 

Suppose y is given. First, note that w, u, v were chosen independently of fi and 

x. 



that 
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Since w is a marker we can identify occurences of it in y, and these occurences 
are precisely those at indices i £ I\, i.e. those whose starting index occurs at times 
when x visits [a]. We thus recover I\, and its density, which is M([ a D- 

Having found 1\ we now look at the density of u's which begin at indices i + 
l(w) + M, i E I\ (note that there may also be it's which appear elsewhere). Let p 
be this density. Then by construction we have 

P \ - KUn) 
MM) ^ %([<*]) 
so from p/n([a\) we can recover (a n ), and hence obtain ip(x'). 
Finally, having found I\ we can recover I 2 because 

h = Z\\J[i-M,i + £(w) + 3M] 
ieii 

From y\i 2 we reconstruct y' = f(x'). Since we know ip(x'), we apply it and recover 
x', and from x' and I 2 we recover x. 
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